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^ . We demonstrate the use of a multidimensional extension of the latent Markov model 

to analyse data from studies with correlated binary responses in developmental psychol- 



ogy. In particular, we consider an experiment based on a battery of tests which was 
administered to pre-school children, at three time periods, in order to measure their in- 
hibitory control and attentional flexibility abilities. Our model represents these abilities 
by two latent traits which are associated to each state of a latent Markov chain. The 

o; 

conditional distribution of the tests outcomes given the latent process depends on these 



abilities through a multidimensional two-parameter logistic parameterisation. We outline 



^ . an EM algorithm to conduct likelihood inference on the model parameters; we also focus 

on likelihood ratio testing of hypotheses on the dimensionality of the model and on the 
transition matrices of the latent process. Through the approach based on the proposed 
model, we find evidence that supports that inhibitory control and attentional flexibility 
can be conceptualised as distinct constructs. Furthermore, we outline developmental as- 
pects of participants' performance on these abilities based on inspection of the estimated 
transition matrices. 
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1 Introduction 



A fundamental scientific challenge in neuropsychology and developmental psychology is the 
search for evidence that two or more postulated mechanisms, thought to underlie the perfor- 
mance of participants on a set of tests, are separable. For example, Donohoe et al. (2006) and 
Kimberg and Farah (2000) provide an instance where two studies yield contradicting conclu- 
sions regarding the separability of the psychological constructs inhibitory control and working 
memory. Whilst Donohoe et al. 's findings suggest that inhibitory control is a construct that 
overlaps with, but is separate to working memory, Kimberg and Farah argue against the theory 
that there is an inhibitory mechanism above and beyond working memory. 

The characterisation of the latent trait underlying the performance of participants on a set 
of tests in terms of a single unidimensional latent variable would imply that all items measure a 
common construct. This model can be contested by specifying a multivariate latent trait which 
would postulate that the mechanisms underlying performance are distinct theoretical concepts. 

Key changes in cognitive development take place during childhood. In this paper we assess 
the separability of the executive functions inhibitory control and attentional flexibility in the 
context of a study involving young children. Longitudinal studies that aim to study develop- 
mental aspects of cognition typically involve the administration of several tasks, to participants, 
where each task is presented in blocks of trials during a single session. This session is subse- 
quently replicated over fixed periods of time, for example every 6 months, when it is believed 
that key changes in child cognition might have occurred. A common protocol approach involves 
the recording of participants' success or failure on every single trial performed during the length 
of the study. This yields a sequence of correlated binary responses for each participant. 

In the presence of dichotomously-scored items, the problem above may be translated into 
that of assessing the dimensionality of an Item Response Theory (IRT) model, such as the 
Rasch model (Rasch, 1961), also known as the one-parameter (1PL) logistic model, or the two- 
parameter logistic (2PL) model (Birnbaum, 1968). This problem has been debated since long 
time in the statistical and psychometric literature. Relevant contributions in this sense are 
those of Martin-L6f (1973), van den Wollenberg (1982a,b), Glas and Verhelst (1995), Verhelst 
(2001), Christensen et al. (2002) and, more recently, Bartolucci (2007). In the latter paper, 
in particular, a class of multidimensional IRT model is proposed which is based on a 2PL 
parameterisation of the probability of responding correctly to each item and on the assumption 
that the latent traits follow a multivariate discrete distribution with an arbitrary number of 



support points. Bartolucci (2007) showed how to test for undimensionality by a Wald statistic, 
or equivalently a likelihood ratio (LR) statistic between a bidimensional and a unidimensional 
model, which are formulated on the basis of the above assumptions. He also showed how this 
procedure may be extended to assess the number of distinct latent traits measured by the test 
items and, consequently, to cluster these items in homogeneous groups. 

The above approaches, and in particular that of Bartolucci (2007), can be directly applied to 
assess the dimensionality of the psychological response process described above, provided that 
responses are collected at a single occasion and that the ability of each subject in responding 
correctly to each item remain constant during the test. This excludes the cases in which the 
same set of items is repeatedly administered to the same subjects during a single testing session, 
which is subsequently replicated at a certain number of occasions separated by a suitable interval 
of time. This scheme is very common in psychological applications as the one which motivates 
this paper. The particular feature of this scheme is that a subject may evolve in his/her latent 
characteristics between occasions and this is not taken into account in the multidimensional 
IRT models mentioned above. This evolution in time is typically due to tiring effects, learning- 
through-training and other developmental phenomena. 

In this paper, we propose a multidimensional model for the analysis of data deriving from 
design protocols that use a repeated measurement scheme at each occasion of a longitudinal 
study. The basic tool is the latent Markov (LM) model of Wiggins (1973), which may be 
seen as an extension for longitudinal data of the latent class model Lazarsfeld and Henry 
(1968) in which each subject is allowed to move between latent classes during the period of 
observation. For a detailed description of the LM model see Langeheine and van de Pol (2002) 
and Bartolucci (2006); for a description with an IRT perspective see Bartolucci et al. (2008). 
In our formulation: (i) the response variables are conditionally independent given a sequence 
of latent variables that follows a first-order Markov chain which is partially homogeneous; (ii) 
each state of the chain is associated to a certain level of each latent trait measured by the test 
items; (iii) the distribution of each response variable depends on these latent states through 
the same 2PL parameterisation adopted by Bartolucci (2007). Evolution of the subjects with 
respect to these latent traits depends on the transition matrices of the model and, on the 
basis of these matrices, we can test the hypothesis that the subjects always remain in the 
same state. Obviously, by exploiting the proposed model, we can also perform an LR test for 
the hypothesis of unidimensionality against that of multidimensionality. In contrast to more 
standard approaches, this test takes into account the longitudinal structure of the data. We 



also allow for the inclusion of covariates affecting the probability to belong to each latent state 
by a parameterisation similar to that of Vermunt et al. (1999). 

For the maximum likelihood (ML) estimation of the LM model, we illustrate an EM algo- 
rithm (Dempster et al., 1977) which closely recalls that illustrated by Bartolucci (2006). At 
the E-step, the algorithm exploits certain recursions developed within the hidden Markov lit- 
erature (MacDonald and Zucchini, 1997). We also deal with the asymptotic distribution of the 
LR statistic for testing hypotheses on the transition probabilities. Note that, under hypotheses 
of this type, the parameters may be on the boundary of the parameter space and, therefore, an 
inferential problem under non-standard conditions arises (Self and Liang, 1987; Silvapulle and 
Sen, 2004). However, as shown by Bartolucci (2006), this LR test statistic has an asymptotic 
chi-bar-squared distribution under the null hypothesis. 

The remainder of this paper is organised as follows. In the next Section we describe the 
psychological study that motivates this paper. In Section 3 we illustrate the main assumptions 
of the proposed multidimensional LM model and in Section 4 we illustrate likelihood inference 
on this model, including testing hypotheses on the dimension of the model and on the transition 
matrices of the latent process. Results of the application of our modelling approach to the study 
on executive function are presented in Section 5. We close with a final discussion in Section 6. 

2 A longitudinal study on executive function 

The study conducted by Shimmon (2004) aims to investigate theoretical questions regarding 
the relationships between the components of the cognitive construct executive function in young 
children. The study comprises data collected from the administration of a battery of tests to 
115 children during a single testing session. This session was subsequently replicated over two 
6-month periods when it was believed that key changes in child cognition might have occurred. 
At the first period of testing the participants' age range was 34-55 months. In this paper 
we restrict our attention to two components of executive function, namely inhibitory control 
and attentional flexibility. These are two abstract concepts that define two closely related 
psychological constructs. We address two methodological issues: (i) we investigate the nature 
of the interrelationship between the two constructs and (ii) we assess developmental trends in 
task performance and scalability of the tasks at various ages. 

The response data collected for each participant consist of a sequence of correlated binary 
outcomes with each component indicating a success or failure for each trial on four tasks ad- 



ministered at each of three time periods. Inhibitory control was measured by the day/night and 
the abstract pattern tasks developed by Gerstadt et al. (1994); each of these tasks was admin- 
istered in blocks of 16 trials. Similarly, attentional flexibility was measured by two versions of 
the Dimensional change card-sort (DCCS) tasks: the DCCS face-up and the DCCS face-down 
tests (Zelazo et al., 1996). Each of these tests was administered in blocks of 6 trials. Partic- 
ipants were randomly allocated to one of two testing orderings. Half of the group performed 
the tasks in the following order: day/night, DCCS face-down, abstract pattern, DCCS face- 
up, whereas the other half of the group followed the order: abstract pattern, DCCS face-up, 
day/night, DCCS face-down. Thus the length of the response sequence for each participant is 
44 (16+6+16+6) at each time period. The maximum number of observations of each subject 
is then 3 x 44 = 132. For an illustration of this experiment see Figure 1. 
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Figure 1: Experimental stages of participants' performance. The matrices Yl r , r = 
contain the transition probabilities which are estimated within the proposed approach. 
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Note that the abstract pattern task is expected by design to demand less inhibitory control 
skill than the day/night task. Similarly, the face-up version of the DCCS task has lower working 
memory demands than the face-down version. The rationale of the experimental design was to 
administer the harder tasks previous to their easier versions to half of the participants, and the 
reverse order to the other half in order to control for potential testing ordering effects. 



3 The multidimensional latent Markov model 

Let n denote the sample size, let Ti, i — 1, . . . , n, denote the number of tasks administered to 
subject i and let Y i = (Yq, . . . , Y^) denote the corresponding vector of response variables, with 
Y it being a random variable equal to 1 if subject % performs correctly the task administered 
at occasion t and otherwise. From the description in the previous section it is clear that 
the subjects in the sample are administered the tasks in a different order, depending on the 
experimental design which is randomly adopted for each of them. To take this into account we 
introduce the index j it , i = 1, . . . , n, t = 1, . . . , T iy for the type of task administered to subject 
% at occasion t. In practice, jn G J = {1, . . . , J} for every i and t, with J denoting the number 
of different types of task. In our application J = 4, with ju equal to 1 for day/night task, 2 for 
abstract pattern, 3 for DCCS face-down and 4 for DCCS face-up. 

3.1 Basic assumptions 

The proposed model is based on the assumption that the tasks under consideration measure s 
different types of latent traits or, more specifically, abilities; then, we denote by J ^ d = 1, . . . , s, 
the subsets of J containing indices of the tasks measuring ability of type d. In our application, 
for instance, it is reasonable to set s = 2, with J \ = {1,2} and J^2 = {3, 4}, so that day/night 
and abstract pattern tasks measure the ability of type 1 and DCCS tasks measures the ability 
of type 2 in both face-down and face-up versions. Consequently to each subject i is associated 
an s-dimensional vector of abilities which is allowed to depend on time and is denoted by 
&it — {Giti-, ■ ■ ■ , Gits)'- We also denote by Xj(O it ) the probability that this subject performs 
correctly a task of type j at occasion t. 

Following Bartolucci (2007), we adopt a 2PL parameterisation (Birnbaum, 1968) for this 
conditional probability, so that 



where 7j is the discriminant index for the jth item and 5jd is a dummy variable equal to 1 
if j G 3d and to otherwise. With this notation, and taking into account that each subject 
faces a specific sequence of tasks, the probability that subject i responds yu to the t-th task 
administered is 





d 




(2) 



Obviously, the simpler one-parameter (1PL) logistic parameterisation (Rasch, 1961) could also 
be used. In this case, we have 

logit[Aj(0 it )] = ^ bjdOitd ~ 13 j, j = 1, • • • , J- (3) 

d 

This parameterisation is less flexible than the above one since it assumes that all taks have the 
same discriminant power, i.e. "fj — 1, j — 1, . . . , J. On the other hand, the resulting model is 
simpler to estimate. 

In order to model the dynamics of each ability across t, we assume that for each subject i 
the latent process On, ... , 0^ follows a first-order Markov chain with state space . . . , 
where k is the number of latent states, with initial probabilities 7r ic = p(0n = £ c ), c = 1, . . . , k, 
and transition probabilities 7r^ = p(0u = $, d \0i^-i = £ c ), c,d— l,...,k. Since we consider the 
abilities as the only explanatory variables of the probability of performing correctly an item, 
as in the latent Markov model of Wiggins (1973) we assume that the response variables in Vj 
are conditional independent given On, ... , 0%t v The resulting model is then an extension of the 
latent Markov Rasch model described by Bartolucci et al. (2008) based on a multidimensional 
2PL parameterisation for the conditional distribution of the response variables given the latent 
process and allowing for different sequences of items between subjects. Therefore, experimental 
design aspects such as testing order can also be taken into consideration in the statistical 
analysis of the data. Moreover, by assuming the existence of a latent Markov chain, we allow a 
subject to move between latent classes in a way that depends on the transition probabilities and 
hence we also generalise the multidimensional IRT model of Bartolucci (2007) which implicitly 
assumes that every subject maintains the same level of each ability across time occasions. 

A final point concerns how to model in a parsimonious way the initial and transition prob- 
abilities of the latent process taking into account the difference between subjects in terms of 
individual covariates and considering the experimental design. For this aim, and similarly to 
Vermunt et al. (1999), we adopt the following logistic parameterisation for the initial probabil- 
ities: 

log— = x'i(f) c , c=2,...,k, i = l,...,n, (4) 

where Xi is a vector of covariates measured at the first occasion. In our application, we allow 
these initial probabilities to depend on the age of the subject at beginning of the study, indicated 
by di for subject i, and then we let Xi = (1, a^)'. 

For what concerns the transition probabilities of the latent process, we specify those tran- 
sition matrices that encompass the different transitional stages of substantive interest. For 



instance, for our application, we use a first type of transition matrix within the sequence of 
day/night tasks and a second type within the second of abstract pattern tasks and so on; see 
Section 6 for a detailed description. Note that we could also allow the transition probabilities to 
depend on the individual covariates through a parameterisation of the probabilities irfj^ similar 
to that in (4). This however would complicate the approach and we do not pursue this further 
here. 

Submodels may also be considered. The most interesting is obviously that of unidimension- 
ality that may be formulated by requiring that £ c = £ c lfc, with 1/. denoting a column vector 
of k ones, so that the elements of £ c are equal to each other. The hypothesis that there is no 
transition between latent states at a certain occasion is also of interest. This may be formulated 
by requiring that the corresponding matrix of transition probabilities is diagonal; see Bartolucci 
(2006) for a detailed description of these constraints. Finally, the assumption that there is no 
effect of the covariates on the initial probabilities is also of interest. In our application this 
assumption may be formulated by letting 

3.2 Manifest distribution of the response variables 

In order to derive the manifest distribution of the response variables Yu, we consider the vector 
Ci = (Cji, . . . , CiT t ), with Cu being the state to which subject i belongs at occasion t, with 
% = 1, . . . , n, t = 1, . . . , Tj. In practice, Cu = c, for c = 1, . . . , k, if and only if On = x c . 

The assumption of conditional independence between the elements of Yi given the latent 
process implies that 

j 

where y i denotes a realisation of Yi, Ci denotes a realisation of Ci whose elements are indicated 
by c it and 

Ktc = p(Yit = Vit\0it = £ c )> ( 5 ) 

which is defined on the basis of (2). Consequently, the manifest distribution of Yi may be 
expressed as 

P(Vi) = s ^ J v{.y l \ci)p{.c i ), (6) 

where the sum is extended to all the possible k Ti configurations of Cj. Clearly, the probability 
p(ci) depends on the initial and transition probabilities 7Tj C and nfj d . 



Avoiding the sum in (6), the manifest probability p(y i ) may be efficiently computed by 
exploiting certain recursions known in the hidden Markov literature (MacDonald and Zucchini, 
1997) and which may be efficiently implemented by using matrix notation; we refer to Bartolucci 
et al. (2007) for details. Let 7Tj denote the initial probability vector with element 7Tj C , c = 
l,...,k, and let denote the transition probability matrix with elements c,d = 1, . . . , k, 
c 7^ d. For a fixed vector y { , also let In denote the column vector with elements p(6 it = £ c , Yn = 
yn, . . . , Y it = y it ) for c = 1, . . . , k. For t = 1, . . . ,T i: this vector may be computed by using the 
recursion 

f diag(A il )7r i if j = 1, 

lt \ diag(A,0[n w ] / Z i , t „ 1 otherwise, Uj 

where X it denotes the column vector with elements X itc , c = l,...,k. Once (7) has been 

computed for t = 1, . . . , T iy we obtain p(x) as the sum of the elements of Z^- 



4 Likelihood inference on the model parameters 

Let r\ denote the vector of all model parameters, i.e. the item parameters 0j and jj, the support 
points £ c of the latent distribution, the parameters <p c for the initial probabilities of the latent 
process and the transition probabilities nfj d . 

As usual, we assume that the subjects in the sample have response patterns independent 
from one another, so that the log-likelihood of the LMR model may be expressed as 

Kv) = 5^ lo gb(2/;)]> 

i 

where p(y i ) is computed as a function of r) by using recursion (7). We can maximise i(rj) by 
means of the EM algorithm (Dempster et al., 1977) implemented as in Bartolucci et al. (2007) 
and which we briefly illustrate in the following. 

4.1 Log-likelihood maximisation 

First of all, we need to consider the so-called complete data log-likelihood that may be expressed 
as 

n-n) = EiEe w *c iog(7r ic ) + Efi 2 E c £ d M^SH 

+ Et=i J2c w itc{vit log(Aite) + (i - va) iog(i - Xitc)}], 

where wu c is a dummy variable equal to 1 if subject i is in latent state c at occasion t (i.e. 
Cu = c), zfj d = Wij-ifiWitd is a dummy variable equal to 1 if subject % moves from the c-th to 



the d-th latent state at occasion t (i.e. C^t-i = c and i.e. Cu = d) and the probabilities in X itc 
are defined in (5). 

Obviously, the dummy variables in £*(r}) are not known and then this log-likelihood is 
exploited within the EM algorithm to maximize the incomplete data log-likelihood £{rj) by 
alternating the following two steps until convergence: 

• E-step: compute the conditional expected value of the dummy variables Wa c and Zid given 
the observed data y it , the covariates x it and the current estimate of 77; 

• M-step: update the estimate of 77 by maximizing the expected value of the complete 
log-likelihood £*(r}); this is defined as in (8) with each dummy variable involved in this 
expression substituted by the corresponding expected value computed at the E-step. 

The E-step may be performed by using certain recursions similar to (7). In order to udpate 
the parameters (3j, 7^ and £ c , the M-step requires to run an algorithm to maximize the weighted 
likelihood of a logistic model, which is easily available. A similar algorithm is necessary to 
update the parameters <f> c for the initial probabilities, whereas an explicit expression is available 
to update the transition probabilities irfj d . For a detailed description on how to implement these 
steps, see Bartolucci (2006), Bartolucci et al. (2007) and Bartolucci et al. (2008). 

The algorithm requires to be initialized by choosing suitable starting values for the pa- 
rameters in 77. Trying different sets of starting values is useful to detect multimodality of the 
likelihood, which often arises for latent variable models. These starting values may be gener- 
ated by a random rule and then we take, as ML estimate of the parameters, the value of 77 
which at convergence gives the highest value of £{rj). This estimate is denoted by 77. 

4.2 Model seleciton and testing hypotheses on the parameters 

Obviously, analysing a dataset through the model described above requires to choose the number 
of latent states k. Following the current literature on latent variable models, and the related 
literature on finite mixture models (see McLachlan and Peel, 2000), we suggest to rely on the 
Bayesian Information Criterion (BIC) which was proposed by Schwarz (1978). According to 
this criterion, we choose the value of k corresponding to the minimum of the index 

BIC k = -2£ k + g k \og(n), (9) 

where £ k denotes the maximised log-likelihood and g k is the corresponding number of parame- 
ters. 



Note that the penalization term in (9) only takes the sample size into account. In or- 
der to take into account the overall number of observations, corresponding to ^i^*; we can 
alternatively use the criterion BIC* based on the minimisation of the index 

BICZ = -2i k + g k log(%2Ti). 

i 

Once the number of latent states has been chosen, a hypothesis Hq on the parameters rj 
may be tested by the likelihood ratio (LR) statistic 

D = -2[£(f, ) - £(fi)], 

where r) is the ML estimate of r\ under Hq; this estimate may also be computed by using the 
EM algorithm illustrated above. 

When standard regularity conditions hold, the asymptotic null distribution of the statistic 
D is a chi-squared distribution with a number of degrees of freedom equal to the number of 
independent constraints used to express H . In particular, through this procedure we may 
test hypotheses on the transition probabilities of the latent process. As already mentioned, 
standard asymptotic results on the distribution of LR test statistic are not ensured to hold in 
this case. This happens, for instance, for the hypothesis that a transition matrix is diagonal 
and thus transitions between latent states is not possible. However, by using certain results 
on constrained statistical inference (Self and Liang, 1987; Silvapulle and Sen, 2004), it was 
shown by Bartolucci (2006) that the null asymptotic distribution of D is of chi-bar-squared 
type (Shapiro, 1988; Silvapulle and Sen, 2004). This is a mixture of standard chi-squared 
distribution with weights that may be computed by a simple rule. 

5 Results 

In this section we investigate models that describe the developmental and dimensional aspects of 
the data collected to measure inhibitory control and attentional flexibility, described in Section 
2. The tasks that measure the above abilities correspond to J = 4 types of item, namely: (i) 
day/night, (ii) abstract pattern, (iii) DCCS face-down and (iv) DCCS face-up. As we described 
in detail in Section 3, our modelling approach relies on a latent Markov process. The parameters 
of the distribution of this process are specified as follows: 

(i) the initial probabilities of the latent process depend on age according to (4); 



(ii) there are 8 different transition matrices of the latent process that model participants' 
ability transition during the period of experimentation. The experimental stages of sub- 
stantive interest and their corresponding transition probability matrices are depicted in 
Figure 1. Each testing session in the study involves two transition matrices. The first 
matrix measures changes in performance within the inhibitory control sequence of trials 
and transition to the attentional flexibility task, whereas the second one measures tran- 
sitions within the attentional flexibility sequence of trials. Thus, changes in performance 
during the two types of testing sessions yield four transition matrices, Hi and n 3 for 
the experimental design where harder tasks precede easier tasks, and II2 and II4 for the 
alternative experimental design. Transition matrices II5 and 116 represent changes in 
abilities between sequences over a week period of time, and similarly, matrices II7 and 
II 8 model changes over six-month time periods for the two experimental designs. 

To complete our model specification we initially assume that the probability of responding 
correctly to each item is unconstrained, so that we have a specific parameter Xj C for the prob- 
ability of responding correctly to each item of type j, j — 1, . . . , J, given each possible latent 
state c, c = 1, . . . , k. Subsequently we test whether the 2PL parameterisation based on (1) is 
reasonable. 

We next turn to the problem of choosing an LM model that provides an appropriate rep- 
resentation of the dependence structure. We first address the choice of the number of latent 
classes. We consider four models with 1 to 4 latent classes and select the fitted model with 
smallest values of the Bayesian information criteria BIC and BIC* where, as shown earlier, BIC 
is penalised by a term equal to the sample size 115 and BIC* is penalised by the total number of 
single trials equal to 12798. Table 1 displays the values of the log-likelihood function evaluated 
at the maximum likelihood estimate, the BIC and BIC* indices for model comparison. 

On the basis of the results in Table 1, we select a model with k = 3 latent states. This 
model may be simplified by testing hypotheses on the equality between some of the transition 
matrices deriving from the experimental stages depicted in Figure 1. For instance, it is of 
interest to test whether patterns of participants' performance within each of the two tasks of 
inhibitory control are comparable. Formally, this involves testing the hypothesis of equality 
between the transition matrices Hi and n 2 . Similarly, comparison between matrices II5 and 
Hq and matrices II7 and Ilg is important because rejection of the hypothesis of equality of 
these matrices would suggest the presence of task order effects. 



Table 1: Comparison of models with various values of k by maximum log-likelihood, BIC and 
BIC* indices 



k 


log-likelihood 


no. of par. 


BIC 


BIC* 


1 


-7050.1 


4 


14119.0 


14138.0 


2 


-3907.5 


26 


7938.4 


8060.9 


3 


-3545.3 


64 


7394.3 


7695.9 


4 


-3461.1 


118 


7482.1 


8038.1 



Note: No. of par. is the number of parameters; BIC=Baycsian Information Criterion with penalization term 
equal to the sample size 115; BIC*=BIC with extended penalization term to the total number of single trials 
12798. 



Table 2 summarises the results of testing the above and other relevant hypotheses on the 
transition matrices. We highlight in boldface those constrained models for which the BIC and 
BIC* indices are minimum among fitted models with comparable number of parameters. 



Table 2: Hypotheses testing on the transition matrices that represent changes in participants' 



performance during the experimental stages depicted in Figure 1 



Hypothesis 






log-likelihood 


No. par. 


BIC 


BIC* 


n x = n 2 






-3556.3 


58 


7387.9 


7661.2 


n 3 = n 4 






-3557.2 


58 


7389.6 


7662.9 


n 5 = n 6 






-3555.2 


58 


7385.6 


7658.9 


n 7 = n 8 






-3555 


58 


7385.3 


7658.6 


rix = n 2 , n 5 = 


n 6 




-3569.8 


52 


7386.3 


7631.3 


n 3 = n 4 , n 5 = 


n 6 




-3562.2 


52 


7371.1 


7616.1 


n 5 = n 6 , n 7 = 


n 8 




-3554.8 


52 


7356.4 


7601.4 


rix = n 2 , n 5 = 


n 6 , n 7 = 


n 8 


-3573.5 


46 


7365.2 


7582 


n 3 = n 4 , n 5 = 


= n 6 , n 7 


= n 8 


-3566.7 


46 


7351.6 


7568.3 


n x = n 2 , n 3 = 


n 4 , n 5 = 


n 6 , n 7 = n 8 


-3572.2 


40 


7334.2 


7522.7 


rix = n 2 = i 3 






-4014 


34 


8189.2 


8349.4 


n 3 = n 4 = J 3 






-3709.9 


34 


7581.1 


7741.3 


n 5 = n 6 = z 3 






-3641.5 


34 


7444.3 


7604.5 


n 7 = n 8 = i 3 






-3656.6 


34 


7474.6 


7634.8 



Note: No. par.= Number of parameters; BIC=Baycsian Information Criterion with penalization term equal 



to the sample size 115; BIC*=BIC with extended penalization term to the total number of single trials 
12798. Constrained models for which BIC and BIC* are minimum among fitted models with comparable 
number of parameters are highlighted in boldface; J 3 = 3 x 3 identity matrix. 



Based on the results presented in Table 2, we adopt a constrained model where the matrices 
Tlx and n 2 are equal and so are matrices n 5 and II 6 , and n 7 and II 8 . Further reduction 
is not plausible as can be seen from the fact that constraining the transition matrices to be 
identity matrices yields larger values of BIC and BIC*. Hence, we can represent the depen- 
dence structure through four transition matrices. The first two transition matrices comprise 
patterns of performance within the inhibitory control and within the attention flexibility items 
respectively. The third transition matrix yields information on changes in performance after a 
week's time; the fourth transition matrix describes developmental changes that take place in 6 
months' time. 

Furthermore, we tested the dependence of the initial probabilities on participant's age. We 
observe that the value of the BIC increases from 7334.2 to 7446.4 when the dependence is 
removed. Thus we conclude that age has a significant effect on the initial probabilities. 

Another important aspect in the search of an appropriate latent Markov model concerns 
the parameterisation of the conditional probabilities of success at performing the task given 
the latent state. We explore the 1PL and 2PL parameterisations, contrasting between the uni- 
dimensional and the bidimensional cases; see equations (1) and (3). We adopt a bidimensional 
2PL model which yields the smallest values of BIC compared with its competing models, as 
can be seen from Table 3. The first ability underlies the inhibitory control tasks, whereas the 
second ability is the basis of the attentional flexibility tasks. This completes our model choice. 

Table 3: Model comparison for unidimensional and bidimensional parameterisations of the 
Rasch and the 2PL logistic models 



Model 


log-likelihood 


No. par. 


BIC 


BIC* 


Rasch unidimensional 


-3696.0 


34 


7553.2 


7713.4 


Rasch bidimensional 


-3580.6 


36 


7332.0 


7501.6 


2PL unidimensional 


-3679.8 


37 


7535.2 


7709.6 


2PL bidimensional 


-3572.3 


38 


7325.0 


7504.0 



Note: 2PL=two-parameter logistic model; No. par.= Number of parameters; BIC=Bayesian Information 
Criterion with penalization term equal to the sample size 115; BIC*=BIC with extended penalization 
term to the total number of single trials 12798. 

In summary, we adopt a latent Markov model with three latent variables. The initial 
probabilities of the latent process depend on participant's age and there are four transition 
probability matrices whose interpretation was discussed earlier. For this model, the conditional 



probability of responding correctly to each item given the latent process is parameterised as in 
a 2PL bidimensional model Bartolucci (2007). Here, the first dimension represents inhibitory 
control and the second one attentional flexibility. Under the selected model the estimated 
ability parameters are summarised in Table 4. 



Table 4: Ability parameter estimates for each latent state 



Dimension 


latent state 1 


latent state 2 


latent state 3 


Inhibitory control 


-5.454 


0.040 


5.050 


Attentional flexibility 


0.145 


-35.145 


6.176 



The latent states are ordered to reflect lowest to highest ability for both dimensions. Note 
that the estimated ability parameters corresponding to attentional flexibility are large in ab- 
solute value due to the fact that a large number of participants tended to score either zero 
or one during the full 6-item sequences. As noted by the experimenter, this is not a particu- 
lar characteristic of this data set; but other studies (e.g. Zelazo et a!., 1996) report a similar 
phenomenon. 

Table 5 displays estimates of the discriminant and difficulty parameters associated with 
each item. These estimates reveal that the abstract pattern task is considerably easier and less 
discriminant than the day/night task. On the other hand, the two DCCS tasks are comparable 
in level of difficultly, but the DCCS face-up task discriminates better than the face-down version 
of this task. These results confirm the experimenter's hypotheses (see Shimmon, 2004), namely 
that participants would perform better at the abstract pattern and the DCCS face-up tasks 
than at their counterparts. Furthermore, in the case of the inhibitory control tasks, the abstract 
pattern task was regarded as a control for the day/night task, in the sense that it is expected 
to make less inhibitory demands; therefore, our finding that the day/night task is a better 
discriminator is reassuring. 

Based on the estimates reported in Tables 4 and 5 we can calculate the conditional proba- 
bilities of success to respond to an item for a given participant given his/her latent state; see 
equation (1). These probabilities are presented in Table 6. 

A noteworthy fact is that the participants who fall in the latent state 1 have a fifty percent 
probability of success in the DCCS tasks but low probabilities of success in the day/night task. 
The interpretation of the ability on the DCCS tasks for such participants must be made with 



Table 5: Discriminant and difficulty parameter estimates associated with each item 





discriminant 


difficulty 


Item 


index 


level 


day/night 


1.000 


0.000 


abstract pattern 


0.610 


-4.880 


DCCS face-down 


1.000 


0.000 


DCCS face-up 


2.744 


0.107 



Table 6: Estimated Conditional probabilities of responding correctly to each item given latent 
state 



Item 


latent state 1 


latent state 2 


latent state 3 


day/night 


0.0043 


0.5101 


0.9936 


abstract pattern 


0.4132 


0.9527 


0.9977 


DCCS face-down 


0.5362 


0.0000 


0.9979 


DCCS face-up 


0.5264 


0.0000 


1.0000 



care, since the experimenter has reported that she believes that some participants adopted a 
strategy, i.e. alternating the card placement regardless of the game's rules, to perform the card 
tasks that led to a medium size number of successes in the 6-trial sequences. Importantly, 
our analysis strongly supports the hypothesis that there are two separable abilities, namely 
inhibitory control and attentional flexibility, underlying performance at the items administered 
to participants. We next address developmental questions regarding these abilities by exploring 
the transition probabilities of the assumed latent process. 

We examine the effect of age on performance by modelling the initial probabilities of the 
latent process in a logistic regression with age as explanatory variable. The estimates of the 
regression coefficients of age, corresponding to the logistic curves of the initial probabilities 
for the second and third latent states, are 0.111 and 0.361 respectively. That is, the model 
classifies older participants into states of higher performance. Averaged over all the subjects in 
the sample, the three latent states have initial probabilities equal to 0.261, 0.360 and 0.379. The 
second and third state have comparable dimension, whereas the first is smaller and contains 
around one fourth of the sample. 

Table 7 reports the estimated transition probabilities matrices. Recall that earlier in this 
section we identified four transition matrices to represent the change patterns on participants' 



performance. The matrix denoted by "within IC items" in Table 7 encompasses changes in 
performance within the inhibitory control items and transitions to the attention flexibility items. 
It corresponds to IIi = II2. The "within AF items" represents performance changes within 
the attention flexibility items only and corresponds to II3 = II4. The remaining two transition 
matrices represent transition of states after a week and six-month periods respectively, that is 

n 5 = n 6 and n 7 = n 8 . 



Table 7: Estimated transition probability matrices 




within IC items 


within AF items 


latent state 


1 2 3 


1 2 3 


1 


0.9809 0.0191 0.0000 


0.5883 0.2211 0.1905 


2 


0.0228 0.9146 0.0626 


0.0049 0.9951 0.0000 


3 


0.0114 0.0372 0.9514 


0.0226 0.0022 0.9753 




between AF and IC 


between AF and IC 




a week later 


6 months later 


latent state 


1 2 3 


1 2 3 


1 


0.0000 0.5763 0.4237 


0.3002 0.6997 0.0000 


2 


0.0609 0.0000 0.9391 


0.1795 0.1165 0.7040 


3 


0.0000 0.0000 1.0000 


0.1221 0.1362 0.7416 



Note: IC=inhibitory control; AF=attentional flexibility. 



The within IC items transition matrix show evidence of a tiring effect as reflected by some 
tendency of participants to move from higher to lower states within the sequences of inhibitory 
control trials. In contrast, the within AF items transition matrix primarily indicates the pres- 
ence of learning effects as participants tend to move from the first to the higher states. Simi- 
larly, there is a learning effect during the transition after a week's time. Finally, the probability 
matrix concerning transitions over a six-month period shows strong developmental effects for 
participants in states 1 and 2. There are also some evidence of participants moving back from 
the second and third states to lower states. 

A final point concerns the computation effort involved in the analysis based on the LM model 
described in this paper. We found that, although estimation is based on the EM algorithm 
which may require a large number of iterations, fitting the model on the data described above 
is rather fast even with a large number of latent states. This is because we use an efficient 
implementation based on recursions taken from the hidden-Markov literature. Implementing 
these recursions is rather easy in mathematical packages such as Matlab. On the other hand, 
we found that for the data considered here, most of the fitted models have a multimodal 
likelihood. It is then crucial to try different starting values for the EM algorithm as described 



at the end of Section 4.1. We make our Matlab implementation of the EM algorithm, and 
the related routine to choose the starting values, available to the readers upon request. 

6 Discussion 

We have developed an approach based on a multidimensional latent Markov model suitable for 
the analysis of long binary response sequences that measure two or more latent traits. 

For each subject in the sample, our approach relies on Markov chains with transition prob- 
abilities that are of substantive interest, in order to measure dynamic changes in the responses 
during repeated trials and over fixed periods of time. Hence the applicability of the method 
lies on studies where the aim is at studying developmental or detrimental changes over time 
in an intuitive way. Importantly, we show how to test the dimensionality of the problem by 
developing valid LR tests. 

The modelling approach has been illustrated through the analysis of data from a develop- 
mental study. Throughout the statistical analysis we emphasized the importance of taking into 
consideration experimental aspects of the study, for instance to account for potential order- 
ing effects. Our main finding was evidence that supports that the tasks measure two distinct 
psychological constructs, namely inhibitory control and attentional flexibility. An obvious ex- 
tension of the applicability of our method would be to investigate hypotheses on the relationship 
between other executive skills such as working memory and planning. 

In addition, we were able to demonstrate developmental aspects of the two abilities. In 
particular that these abilities develop at an early age, as the covariate age played an important 
role at performance. The dynamics within sequences were different among the two types of tasks 
considered. The presence of tiring effects was clear within the inhibitory control sequences. In 
contrast, participants where either able or unable to perform the attentional flexibility task. It 
is plausible that a third group showed a spurious medium probability of success at this task. 
Further investigation on the nature of this task is warranted. 

A limitation of our approach is the assumption of a first order Markov chain for the latent 
process. However, this assumption seems to be suitable for the data analysed in this arti- 
cle. Indeed, a quick inspection of time dependence through the assumption of a continuous 
latent process favours a short-range dependence. Note also, that we do not require to make 
any distributional assumptions on the latent process as opposed to the conventional Gaussian 
assumption made for continuous processes. 
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